Predicting robustness against transient faults of MPI based programs

نویسندگان

  • Joao Gramacho
  • Alvaro Wong
  • Dolores Rexachs
  • Emilio Luque
چکیده

The evaluation of a program’s behavior in the presence of transient faults is often a very time consuming work. In order to achieve significant data, thousands of executions are normally required and each execution will have the significant overhead of the fault injection environment. Our previously published methodology reduced significantly the time needed to evaluate the robustness of a program execution by exhaustively analyzing its basic blocks trace instead of using fault injection. In this paper we present an even forward improvement in the evaluation time of parallel programs robustness against transient faults by combining our methodology with PAS2P – a method that strives to describe an application based on its messagepassing activity. The combination of our approach and PAS2P allowed us to predict the robustness of larger parallel programs, reducing in some cases in more than 20 times the time needed to calculate the robustness while obtaining a robustness prediction error of less than 4%. Transient faults, robustness, soft errors, reliability, PAS2P

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of transient ischemic dilation ratios in SPECT and SPECT-CT myocardial perfusion imaging in the low pre-test probability group

Introduction: The main purpose of this study was to compare transient ischemic dilation (TID) ratios in SPECT-low dose CT and SPECT Myocardial Perfusion Imaging (MPI) by application of different quantitative programs and quantify the possible shift in the upper normal limits of TID ratio in the SPECT-CT MPI. Methods: 149 Patients with low pre-test probability for coronary artery disease (CAD),...

متن کامل

A Methodology to Calculate a Program’s Robustness against Transient Faults

Computer chips implementation technologies are evolving to obtain more performance. The side effect of such a scenario is that processors are less robust than ever against transient faults. As on-chip solutions are expensive or tend to degrade processor performance, the efforts to deal with these transient faults in higher levels are increasing. Software based fault tolerance approaches against...

متن کامل

Debugging Tool for Localizing Faulty Processes in Message Passing Programs

In message passing programs, once a process terminates with an unexpected error, the terminated process can propagate the error to the rest of processes through communication dependencies, resulting in a program failure. Therefore, to locate faults, developers must identify the group of processes involved in the original error and faulty processes that activate faults. This paper presents a nov...

متن کامل

Robust Model- Based Fault Detection and Isolation for V47/660kW Wind Turbine

In this paper, in order to increase the efficiency, to reduce the cost and to prevent the failures of wind turbines, which lead to an extensive break down, a robust fault diagnosis system is proposed for V47/660kW wind turbine operated in Manjil wind farm, Gilan province, Iran. According to the acquired data from Iran wind turbine industry, common faults of the wind turbine such as sensor fault...

متن کامل

Comparison of transient ischemic dilation ratios in SPECT and SPECT-CT myocardial perfusion imaging in the low pre-test probability group

Introduction: The main purpose of this study was to compare transient ischemic dilation (TID) ratios in SPECT-low dose CT and SPECT Myocardial Perfusion Imaging (MPI) by application of different quantitative programs and quantify the possible shift in the upper normal limits of TID ratio in the SPECT-CT MPI. Methods: 149 Patients with low pre-test probability for coronary artery disease (CAD), ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IJCSE

دوره 12  شماره 

صفحات  -

تاریخ انتشار 2016